Distinguishability, Ensemble Steering, and the No-Signaling Principle 
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We consider a fundamental operational task, distinguishing systems in different states, in the 
framework of generalized probabilistic theories and provide a general formalism of the task. Op- 
timality conditions are explicitly shown. With the formalism established, we show that for any 
generalized probabilistic theories where the ensemble steering is allowed, the no-signaling princi- 
ple can tightly determine the distinguishability of states, independently to how their state spaces 
are structured. This shows that distinguishability is generally characterized as a consequence that 
ensemble steering on states and the non-signaling constraint on probabilities are compromised. 
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One of fundamental aspects of the quantum formal- 
ism is that both states and probabilities are needed to 
characterize given systems before and after measurement, 
respectively. This is connected to how quantum sys- 
tems achieve practical advantages in information pro- 
cessing. For instance, nonlocal quantum probabilities 
[l[ are resources for secure communication Q. Interest- 
ingly, quantum systems cannot attain correlations that 
are maximally non-local in non-signaling frameworks 
0. Information-theoretic characterizations to quantum 
probabilities, such as the information causality as a way 
of generalizing the no-signaling condition [3l pl @ j eluci- 
date strengths and limitations of quantum systems, show- 
ing deep connections to quantum foundations and infor- 
mation applications. 

On the other hand, the formalism of quantum states 
and their dynamics, resources for computational and in- 
formation processing, are highly non-trivial among non- 
signaling theories in the sense that entanglement is con- 
tained [ll- Contrast to the case of quantum proba- 
bilities, capabilities of quantum states have been tightly 
constrained and guided by the no-signaling principle. 
Namely, assuming entanglement of two parties, the other 
party's better cloning of quantum states Q or more effi- 
cient quantum dynamics [lO| than what can be described 
within quantum theory are excluded by the no-signaling 
condition: by the course of such better performances, 
better distinguishability that would establish a fastcr- 
than-light communication can be achieved [ll|. 

From the aforc-mcntioned results, we identify the dis- 
tinguishability, i.e. distinguishing systems in different 
states, as the crucial task that derives limitations to other 
operational tasks. Note also that the distinguishability is 
generally a fundamental building block for system iden- 
tifications and practical applications e.g. it determines 
how efficiently information can be transferred 12|, 



or 



provides quantification of the universally composable se- 
curity in cryptographic applications e.g. Rcfs. 13 1 [l^ . 



The present work is motivated to consider the under- 
lying state space in which the distinguishabilty is tightly 



constrained by the no-signaling principle: Is it uniquely 
the space of quantum states? If not, how general is it that 
the distinguishability is determined generically by the no- 
signaling principle? To this end, we consider the distin- 
guishability in generalized probabilistic theories (GPTs) 
ISMlTjl where states, measurement, and their relations 
are extended to cover other non-signaling theories. We 
first develop a theoretical tool for the purpose, a gen- 
eral method of distinguishing states with minimal errors 
in GPTs. With the established method, we show that 
the distinguishability is a quantity independent to non- 
locality: the distinguishably is not enhanced as a theory 
contains more, or less, non-locality in its bipartite exten- 
sion. Then, we show that for GPTs where the ensemble 
steering is allowed, the distinguishability is immediately 
determined by the no-signaling constraint, regardless of 
how their state spaces are structured. This shows that 
the distinguishability in GPTs is generally found as a 
consequence that ensemble steering on states and the no- 
signaling condition on probabilities are compromised. 

We first begin with the framework of GPTs [l^, ex- 
ploiting the mathematical formalism shown in Ref. jTij . 
sec also Ref. [l3l- The set of states, denoted by 
57, consists of all possible states that a system can be 
prepared in. Any probabilistic mixture of states, i.e. 
pwi + {1 — p)'W2 G 57 for 'Wi,W2 € 51 and probability 
p is also a state, and thus the set is convex. A general 
mapping from states to probabilities is described by ef- 
fects, linear functionals O — > [0, 1] : a measurement s 
is denoted by a set of effects, E^'^^ ~ {ex^''}^i, and the 
probability of getting outcome x for measurement s when 
state w is given is, p(x|s) = ei^^[u']. A unit effect u de- 
notes that a measurement is certain to occur, that is, 
u[w] = 1 for all w e 51, so that for any measurement s, it 

holds J2x ^'f*^ ~ effects arc dual to the state space, 

they are also convex. 

The distinguishability of states in GPT can be de- 
scribed as a game of two parties, Alice and Bob. They 
have agreed a set of states in advance, and Alice prepares 
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a system in one of N states with some probabihty and 
gives it to Bob. If Bob makes a correct guess, their score 
is given 1, otherwise 0. The goal is then to maximize the 
average score over possible measurements, which defines 
minimum-error state discrimination. 

Suppose that states {wx}^^i and prior probabilities 
{Qx}^=i that Alice generates are in agreement, which al- 
together are written by {^x, Wxl^i- Bob has to find opti- 
mal measurement {ex}^i fulfilling Cx = u, such that 
he makes guesses from each effect e^- Let PB\A{^\y) = 
exfwy] denote the probability that Bob makes a guess Wx 
when state Wy is given by Alice. The task is to maximize 
the guessing probability in the following, 

N N 

Pgucss max^ (7xPb|a(x|x) = max ^ ^xCx [wx] (1) 

x=l x=l 

where the maximization runs over all effects. Note that 
GPTs are generally not self-dual, meaning an isomor- 
phism between two spaces does not exist in general [isj , 
and thus state and effect spaces are generally distinct. 

The sole fact that state and effect spaces are convex 
allows us to formalize the discrimination problem in the 
convex optimization framework fioj . This in fact pro- 
vides a general approach of finding optimal discrimina- 
tion in GPTs. For states {q^, Wxj^i, we take the form in 
Eq. ([T} as the primal problem denoted by p* and derive 
its dual d* , as follows, 

TV N 

p* = max{y^ Qxe^iwy:] || ex > Vx, ^e^^u} (2) 

x=l x=l 

d* = min{u[/i] || K > q^w^, x = 1, • • • ,iV} (3) 

where inequalities mean the order relation in the convex 
set: by Cx > 0, it is meant that e^lw] > for all w G fi, 
and by if > gx^w, that e[K — qxWx] > for all effects e. 

The property called the strong duality holds true in the 
above, meaning that both solutions from primal and dual 
problems are equal, i.e. p* = d*. This is obtained from 
the so-called Slater's constraint quantification in convex 
optimization: a sufficient condition for the strong dual- 
ity can be the strict feasibility, that is, the existence of 
a strictly feasible point of parameters. For instance, pri- 
mal parameters {cx = u/N}^^i are in the case, since 
ex[wy] > Vx, y and ^^e^ — 1. From this, it is shown 
that the guessing probability can be obtained from either 
the primal or the dual problem. 

In another approach in convex optimization, called 
complementarity problem, optimality conditions of a 
given optimization problem are directly analyzed. This 
deals with both primal and dual parameters in Eqs. ([2]) 
and ([3]), and therefore is not considered more efficient 
in numerics. The advantage then lies at the fact that 
generic structures existing in the problem are exploited. 
The optimality conditions can be summarized by the so- 
called Karush-Kuhn- Tucker (KKT) conditions. They are 



constraints listed in Eqs. ^ and (j3]), together with the 
foUowings, 

(Symmetry parameter) K = q^w^ -\- r^d^, V x (4) 

(Orthogonality) e^lr^dx] = 0, V x, (5) 

where r^ G [0, 1] for all x, and {rfxj^i which we call 
complementary states are normalized, i.e. u[dx\ = 1- 
The first condition, symmetry parameter, follows from 
the Lagrangian stability and shows that for any discrim- 
ination problem e.g. {fe, u^xj^i, there exists a single 
parameter K which is decomposed into N different ways 
with given states and complementary states {tx, dx}x=i- 
Then, the second condition in Eq. ([5]) from the comple- 
mentary slackness characterizes optimal effects by the or- 
thogonality relation between complementary states and 
optimal effects. These generalize optimality conditions 
for quantum states in Refs. (lH to all GPTs, see 

also different forms of optimality conditions [2^ . 

Primal and dual parameters satisfying the KKT condi- 
tions are automatically optimal parameters by which so- 
lutions are obtained in the primal and the dual problems. 
Moreover, for the problem that we consider here, they 
give the same solution to both problems as the strong 
duality holds. Conversely, the fact that the strong du- 
ality holds in Eqs. ([2]) and ([3]) implies the existence of 
optimal parameters which satisfy KKT conditions and 
give the guessing probability in Eq. ([1]). All these fol- 
low from the property that state spaces in GPTs forms 
convex. 

GoUecting all these, we now formalize a geomet- 
ric method of solving minimum-error discrimination in 
GPTs. We first remark that, in optimality conditions 
in Eqs. ([4]) and ([5]), constraints for states and effects 
are separated. The symmetry parameter K is charac- 
terized on a state space and gives the guessing prob- 
abihty, see Eq. ([U, i.e. Pgucss = u[K] = q^ + r^. 
Then, the guessing probability can be found by search- 
ing complementary states {rx,dx}^=i fulfilhng Eq. (j4|) 
on the state space. This can be described in a system- 
atic way, as follows. Let us define a polytope denoted by 
'P{{qx,Wx}x=i) of given states in the state space: each 
vertex of the polytope corresponds to unnormalized state 
qxWx for X = 1, • • • ,N. Then, the polytope of comple- 
mentary states, V{{rj^,dx}^^i), is in fact immediately 
congruent to V{{qx, Wx}^i) in the state space: from Eq. 
(|4]) the following holds, 

qxWx - qyWy = rydy - r^d^, for aU x, y, (6) 

which shows that corresponding lines of two polytopes 
Viiqx.Wxj^^i) and P({'^x, rfx}^=i) are of equal lengths 
and anti-parallel. Then, from the underlying geometry 
of the state space, one can find complementary states by 
putting two congruent polytopes such that the condition 
in Eq. Q holds. Optimal effects can be found from the 
orthogonal relation in Eq. ([5]), accordingly. 
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When prior probabilities are equal i.e. = 1/N for 
all X, the guessing probability is in a simpler form. First, 
we have Tx = ry for all x, y: this is obtained from the 
expression Pguess = <?x + ''x for any x. Denoted by r := Tx 
for all X, the guessing probability is characterized as, 



Pguess = + "^i*^ ^ = 



N ' 



(7) 



where the expression of r follows from the condition in 
Eq. © with a distance measure || • || that can be defined 
in the state space. The parameter r has a meaning as the 
ratio between two polytopes, P({l/iV, Wx}^i) of given 
states, and V{{d^}^^^) of only complementary states. 

We illustrate the general method of state discrimina- 
tion in GPTs, with an example called the polygon sys- 



tems shown in Refs. [17| [18|. We consider the case 



of four states, which is of particular interest as its bi- 
partite non-signaling extension can show the maximally 
non-local correlations. Four states {wxlx^i and measure- 
ment {i?''''-'}x=i with E^^^ ~ {^'o^ I'^'i^} E^re given as 



Wx 



/ 


cos27rx/4 \ 


( 


yj COS 7r/4 


(x) 1 

'^0 =2 




sin 27rx/4 




yj cos 7T / 4 


V 


1 1 


V 



yj COS TT / 4 

sin(2x-l)7r/4 



cos 7T / 4 

1 



(8) 



and 



' , where the unit effect u = (0,0, 1)-^, 
with the Euclidean inner product for p{a\x) = e^\w]. 

For four states {1/4, u'x}x=i, the goal is now to find the 
guessing probability and optimal measurement {ex}x=i- 
Exploiting the underlying geometry (cf. see Fig 2. in Ref. 
[l8|), the polytope V {{1/ A, Wx}t=i) forms a rectangle, 
from which it follows that r = 1/4 from Eq. ([7]). To be 
precise, from the state space geometry, one can see that 



K=-w, 



-Wx+2, for X = 1,2,3,4, mod 4, (9) 



and thus, Pgucss = 1/2. It is also shown that comple- 
mentary states are {r^ = 1/4, dx = u'x+2}x=i- Note 
that these four states are analogous to cases in quantum 
theory, pairs of orthogonal states: for the four quantum 
states, the guessing probability is also given by 1/2 p^ . 
This shows that distinguishability in GPTs may not be 
enhanced in the way that more non-locality is contained 
in non-signaling theories. 

Optimal measurement can be found as follows. Putting 
Cx = eg in Eq. (|5]) for x = 1, • • • ,4, the followings give 
the guessing probability: i) {ex/2}x=i, ii) {61,63}, or 
iii) {62,64}. In the case ii), effect on 61 (63) means 
that given state is either wi or {w2 or ws), since 
61 [^2] = 61 [wa] = 0. Once effect on 61 (63) is given, one 
randomly guesses either wi or 104 {w2 or w^), and the 
guessing probability is obtained 1/2. The case iii) works 
in a similar way. This shows that features in quantum 
cases are shared among GPTs: i) optimal measurement is 



generally not unique [12| . and ii) no- measurement some- 
times gives an optimal strategy [i^l (23j . 

We now move to bipartite extensions in GPTs. The ex- 
tension is specified in an operational way that the ensem- 
ble steering is allowed, i.e. in a certain way, any decom- 
position of Bob's ensemble can be steered by Alice. In 
quantum theory, this was firstly asserted by Schrodinger 
24| and then, with specification to a bipartite Hilbert 
space, formalized as the so-called Gisin-Hughston-Jozsa- 
Wootters theorem [25|. Note that, however, GPTs en- 
dowed with the ensemble steering do not yet single out 
quantum theory [2^. We also distinguish the extension 
from the purification lemma which can characterize quan- 
tum theory 27 1 . 

In what follows, applying the theoretical tool devel- 
oped so far for state discrimination, we show that for 
any GPTs endowed with the ensemble steering, no mat- 
ter how the state space is structured, the optimal dis- 
tinguishability is immediately determined by a way of 
excluding superluminal communication. In other words, 
in such theories, state discrimination that works better 
than what is predicted within a given theory itself, would 
contradict to the no-signaling principle. We first show a 
bound to the optimal state discrimination from the non- 
signaling condition, and then prove that the bound is 
indeed tight i.e. which can be achieved within GPTs. 

We first put the task of state discrimination into a 
non-signaling framework: Bob's distinguishing different 
decompositions of an identical ensemble steered by Al- 
ice at a distance. For states {(/x, u'x}^i to discriminate 
among, suppose that Alice is allowed to steer an ensem- 



ble of Bob in N different decompositions, wb '■= w 
x = I,-- - ,iV 



for 



^ = PxW^x + (1 - Px)cx, with (7x 



Px 



Ex' = lPx' 



(10) 



with some states {cx};^i and probabilities {px}^i- By 
the ensemble steering, it is meant that any decomposi- 
tion of an ensemble can be prepared by Alice. Since it 
is an identical ensemble, Bob's measurement gains no 
knowledge about which decomposition is given to him 
until Alice announces about her steering. In this way, a 
non-signaling scenario is naturally defined. 

Let us briefly sketch how to constrain state discrimina- 
tion using the non-signaling scenario. Suppose that Bob, 
nevertheless, attempts to distinguish decompositions, to 
guess about Alice's steering. The strategy exploits dis- 
crimination among those states {wx}^i m the ensem- 
ble: optimizing measurement for the discrimination and 
then getting outcome x, he finds that corresponding state 
Wx exists in the ensemble and concludes that Alice has 
steered the decomposition see Eq. (fTO|). In this way, 
state discrimination among states {wx}i^i in GPTs can- 
not work arbitrarily well, and consequently the guessing 
probability must be limited. 
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An upper bound to the guessing probabiUty is expUc- 
itly derived as fohows. First, the no-signahng condition 
appUes to Bob's guessing on average about Ahce's steer- 
ing. Let ^B|A(x|y) denote the probabihty of Bob's con- 
cluding X about Alice's steering y after any strategies of 
him, for which it holds that 



N 

J2PB\Ai^\^)<l, 



(11) 



from the no-signaling condition [17| [28[, see also Ap- 
pendix II. If this is not fulfilled, one can expli citly 
construct a superluminal communication protocol [ll|. 
Next, the strategy of Bob's guessing Alice's steering op- 
timizes measurement for the discrimination among states 
{wx}x=it so that outcome x implies existence of state Wx 
in the ensemble from which Alice's steering is concluded 
as the ensemble ui^-* . If Alice has steered Bob's ensem- 



ble in Eq. pU]) . Bob's correct conclusion happens 
when i) w-x_ is given, which appears with probability px, 
and ii) measurement gives a correct answer: this is with 
probability PxPb|a(x|x). In the strategy, there can be 
contribution in measurement from the other state Cx in 
the ensemble with probability 1 — p^. Thus, it holds, 
PxPb|a(x|x) < Pb|^(x|x). In addition, recall that the 
discrimination is optimized for {qx,Wx}^=i, as the a pri- 
ori probability for among states {lUxj^i is shown in 
Eq. (fTU|) . From the no-signaling condition in Eq. (ITU) , 
we have '^^=1 P^Pb\a{^\^) 1£ Ii from which wc have 



Pgucss = maxy^gxPB|A(x|x) < 



1 



Pi 



-PN 



(12) 



Thus, a non-signaling bound to the guessing probability 
in GPTs is obtained with ensemble-steering parameters. 

The bound is indeed tight, i.e. which can be achieved 
within a given GPT. This can be shown by proving that, 
for any set of states {Qx, Wx}^i, optimal discrimination 
implies that both an identical ensemble in Eq. (|10p and 
effects achieving the bound in Eq. ([T^ exist. 

We first recall the existence of a symmetry parameter 
K that gives the complete characterization of optimal 
discrimination, see Eq. The parameter has N de- 

compositions with complementary states {?'x,dx}^i- Its 
normalization K = K/u[K] is then, for each x, 

K ^PxWx + (1 -Px)dx, with px = qx/u[K]. (13) 

This corresponds to the ensemble steering in Eq. ([TU|) . 
Recall the dual problem in Eq. ^ which gives the guess- 
ing probability in GPTs, as Pguoss = u[-?4r]. Using the 
identity ^x=i1^ ~ 1 ^^'^ relation px ~ qx/u[K], 
the solution in the dual problem can be computed as, 
u{K) = {J2x=iP^)^^ ■ This shows that the bound in Eq. 
(fT2|) can be achieved within given GPTs, and hence the 
tightness is shown. In addition, optimal effects are char- 
acterized with complementary states, sec Eq. ([5]). 



In conclusion, we have developed and established a 
general method of finding optimal discrimination among 
states in GPTs, exploiting underlying geometry given in 
state spaces. This generalizes i) the geom etric formu- 
lation of quantum state discrimination 123 , and ii) opti- 
mality conditions in quantum cases [l^ [21| . The method 
is then illustrated in a GPT, with the four-state polygon 
system shown in Refs. [l3| This is of particular 

interest as its bipartite extension can achieve the max- 
imally non-local correlations Q. Applying the general 
method of optimal discrimination to the example, it is 
shown that the guessing probability is equal to its analo- 
gous case in quantum theory i.e. discrimination of pairs 
of orthogonal states 22] . This clarifies that operational 
tasks based on the distinguishability and the non-locality 
are independent resources: the distinguishability may not 
be enhanced or decreased as GPTs contain more non- 
local correlations. From the example, we also remark 
that for state discrimination in GPTs, i) optimal mea- 
surement is generally not unique and ii) no-measurement 
sometimes give an optimal strategy. All these have been 
known previously in quantum cases 



m [231 



Then, we have shown that, for GPTs where the ensem- 
ble steering is allowed, no matter how their state spaces 
are constructed or structured, the distinguishability is de- 
termined by the no-signaling constraint. This is shown by 
proving that the upper bound derived by the no-signaling 
condition can always be achieved within given GPTs. For 
such theories, the distinguishability lies at the border in 
which ensemble steering on states and the no-signaling 
condition on their probabilities are compromised. This 
means that the tight relation between the no-signaling 
principle and operational tasks is a feature existing not 
only in quantum theory but, generally in non-signaling 
theories. 

Finally, discrimination of states is a task applying a 
single measurement setting, and the present work shows 
that non-signaling probabilities for optimal discrimina- 
tion cannot dictate that the underlying state space is 
quantum-mechanical. If, in some other way, quantum 
states are found to be unique to describe systems among 
other non-signaling theories, it is likely that more mea- 
surement settings arc applied, although non-locality tests 
themselves are not tightly related to the no-signaling con- 
dition. We here leave it open to find such a task - this 
would be the case that, as unique description of systems, 
quantum states are the physical entity behind an in- 
formation processing, together with dimension witnesses 
[iot . The presented method of finding optimal discrimi- 
nation in GPTs is a useful theoretical tool to investigate 
information capabilities and properties of GPTs, e.g., us- 
ing models shown in Ref. |18l |. 

The author thanks G. Chiribella, D. Gross, M. 
Navascues, S. Wehner, and S. Yun for helpful discussions 
and comments. This work is supported by National Re- 
search Foundation and Ministry of Education, Singapore. 
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problem to maximize the success probability can be writ- 
ten as the primal problem as, 
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Appendix I. Convex optimization framework 

We show the derivation of primal and dual problems 
for optimal state discrimination in GPTs. Recall that 
both state and effect spaces are convex. For convenience, 
we follow the formalism and notations in Ref. um- The 



mm /({e.}^=i) 



N 

E 



gxex[wx] 



subject to ex > Vx, = u, 

X 

where by > it is meant that e^[w] > for all w G 
il. Note that the above problem is feasible as the set of 
parameters satisfying constraints is not empty. It is also 
strictly feasible with parameters with ex = u/N for all x. 

To derive the dual problem, the Lagrangian can be 
constructed as, 

X 

X 

with {dx > 0}^]^ and K are dual parameters. The dual 
problem is then obtained by solving the following, 

g({rf^i,A') = min/:({ex}f=i,{dx}li, A-), (14) 
for which the Lagrangian can be further evaluated as, 
^({ex}^=i,K}f=i,^') = ^ex[i^-gxWx-dx]-"[if]. 

X 

Then, the minimization in Eq. ([T4)) is to be, — u[ii'] if 
K — q^Wx + rfx for all x, otherwise — oo. Thus, we have 
= K ~ q^Wy^ for each x. Since dy^>Q i.e. e[d^ > for 
all effects e, we write this by, K > Qx^J^'x for each x. The 
dual problem is therefore as follows. 

max ~u{K) (or, min 
subject to K > q^w^ V x. 

The inequality means an order relation in a convex cone, 
which is determined by effects, i.e. e[K—qxWx] > for all 
effects e. Note also that the dual problem is also strictly 
feasible, with K ~ X^x^x^x- 



Appendix II. No-signaling condition in Eq. (|11[) 

We recall the non-signaling scenario exploited when 
limiting the guessing probability. It is supposed that Al- 
ice can steer Bob's ensemble with N decompositions, la- 
beled by X = 1, • • • , iV. This does not modify the ensem- 
ble average itself of Bob, but only its decompositions. 
Bob therefore gains no knowledge by all means about 
which decomposition is given, unless Alice announces her 
choice about the ensemble steering. In what follows, we 
explain the no-signaling condition in Eq. (|lip in detail. 



6 



Let PAB{ci,b\-A^,B) denote the joint probability distri- 
bution of Alice and Bob. To relate this to the scenario, 
we note that A corresponds to Alice's choice in the steer- 
ing, to prepare the decomposition u;^^ in Eq. PU)) for 
Bob. Then, Bob performs measurement according to her 
choice B and makes a guess about the steered decomposi- 
tion from outcome b. Recall the no-si gna ling condition on 
the joint probability shown in Refs. [17[ [28[, as follows, 
for all X 7^ x' 

J2PAB{a,b\A^^,B)=Y,PAB{a,b\A = ^\B). (15) 



this, the no-signaling condition in Eq. (|15|) is expressed 
as, PB\A{b\A,B) = PB\A{b\A' ,B). Suppose that for opti- 
mal measurement B, it holds that 



N 



^Ps|a(& = x|^ = x,6) > 1 



(16) 



Note that generally Bob's measurement may not be com- 
plete: X;f=i PsiAib = x\A = x',6) < 1 for aU x'. Thus, 
we arrive at the following: 



This means that Alice's choice of steering A docs not 
affect to Bob's statistics, i.e. Bob's measurement-and- 
outcomc relations. This is clear from the ensemble inter- 
pretation followed from the assumption of ensemble steer- 
ing, together with what we have assumed that Bob holds 
an identical ensemble whose decompositions are steered. 

We derive the condition in Eq. ([TT|) by contradiction. 
For convenience, we write Bob's probability for Alice's 
steering as, Pb|a(&|^,6) = T,a PAB{a,b\A, B). With 



N 



^(Pb|^(x|x,6)-Pb|a(x|x',S)) >0 

x=l 

=^ 3x,x' s.t. Pb|^(x|x,6) >Pb|^(x|x',6),(17) 

which contradicts to the no-signaling condition in Eq. 
([T5|). Or, with probabilities in Eq. p7)) . one can also con- 



struct a faster-than-light communication protocol [H 
Thus, the no-signaling condition in Eq. (jlip is shown. 



